LSBATCH: A Distributed Load Sharing Batch System
نویسندگان
چکیده
Batch processing, a primary mode of computing in mainframes and supercomputers, is becoming important for networked systems as the computing environments become more and more distributed. In this paper, we discuss the architectural and design considerations , and some important implementation issues of Lsbatch, a distributed batch system for large{scale, heterogeneous computer systems. Lsbatch supports batched submission and execution of parallel as well as sequential jobs in a system of up to several thousand hosts with possibly diierent architectures, Unix operating system varieties, and power. Implemented on top of the Utopia network operating system 5] as a distributed utility, Lsbatch takes advantage of the rich set of resource and load information services and eecient remote execution mechanisms of Utopia and provides scheduling algorithms to fully utilize the computing resources scattered around a distributed system. To the user, Lsbatch appears very much like a exible single host batch system; the complexities of distributed and heterogeneous computers and system failures are hidden in the implementation. To satisfy real world constraints to resource sharing, a variety of connguration mechanisms are provided in Lsbatch to allow sophisticated site management policies to be established and enforced.
منابع مشابه
Voltage Control and Load Sharing in a DC Islanded Microgrid Based on Disturbance Observer
Increasing DC loads along with DC nature of distributed energy resources (DERs) raises interest to DC microgrids. Conventional droop/non-droop power-sharing in microgrids suffers from load dependent voltage deviation, slow transient response, and requires the parameters of the loads, system and DERs connection status. In this paper, a new nonlinear decentralized back-stepping control strategy f...
متن کاملA Distributed Control Architecture for Autonomous Operation of a Hybrid AC/DC Microgrid System
Hybrid AC/DC microgrids facilitate the procedure of DC power connection into the conventional AC power system by developing the distributed generations (DGs) technologies. The conversion processes between AC and DC electrical powers are more convenient by hybrid systems. In this paper, an energy management system (EMS) for a hybrid microgrid network is proposed due to the optimal utilization of...
متن کاملA worldwide flock of Condors: Load sharing among workstation clusters
Condor is a distributed batch system for sharing the workload of compute-intensive jobs in a pool of Unix workstations connected by a network. In such a Condor pool, idle machines are spotted by Condor and allocated to queued jobs, thus putting otherwise unutilized capacity to e cient use. When institutions owning Condor pools cooperate, they may wish to exploit the joint capacity of their pool...
متن کاملUtopia: a Load Sharing Facility for Large, Heterogeneous Distributed Computer Systems
Load sharing in large heterogeneous distributed systems allows users to access vast amount of computing resources scattered around the system and may provide substantial performance improvements to applications We discuss the design and implementation issues in Utopia a load sharing facility speci cally built for large and heterogeneous systems The system has no restriction on the types of task...
متن کاملHierarchical Load Sharing Policies for Distributed Systems
− Performance of distributed systems can be improved by load sharing. Dynamic load sharing policies take the system state into account in making load distribution decisions. The system state information can be collected in a distributed manner or by a single central controller node. In the distributed scheme, each node gathers the current system state information before making a decision on loa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993